中国科技核心期刊

中文核心期刊

CSCD来源期刊

空间控制技术与应用 ›› 2021, Vol. 47 ›› Issue (6): 52-58.doi: 10.3969/j.issn.1674 1579.2021.06.007

• 论文与报告 • 上一篇    下一篇

基于深度强化学习的机器人推拨优化装箱问题研究

  

  1. 上海交通大学
  • 出版日期:2021-12-25 发布日期:2022-01-20
  • 基金资助:
    国家重点研发计划资助项目(2018AAA0102700)

Optimization of Robotic Bin Packing via Pushing Based on Algorithm

  • Online:2021-12-25 Published:2022-01-20
  • Supported by:
    the National Key R&D Program of China(2018AAA0102700)

摘要: 三维装箱问题是指在满足容积限制、稳定性限制等条件下,将一定数量的物体放入较大容量的箱子中并使空间利用率最大的组合优化问题.三维装箱问题是典型的NP完全问题,通常采用启发式算法规划物体放置的位姿.在使用机器人完成装箱任务时,还要额外考虑机器人操作限制,如机械臂或末端执行器与物体或箱子之间的碰撞、机械臂运动轨迹的规划等,使得部分最优位姿不可行,只能将物体从更高处落下或者将物体放在最优位姿的附近.机器人在抓取、识别和放置时的不确定性也会导致最终放置位置与规划产生偏差.因此,本文提出基于深度强化学习的机器人三维装箱推拨优化方法,以最小化包装箱中物体放置位置的启发式算法分数为目标,通过推拨动作对于已放置的物体位置进行调整、归集,将物体朝角落中压缩,以腾出更多空间,提高装箱空间利用率,减小由于机器人操作不确定性对装箱结果的影响.

关键词: 强化学习, 深度学习, 三维装箱, 机器人推拨

Abstract: 3D bin packing problem is a combinatorial optimization problem that needs packing a certain number of objects and maximizing the volume utilization under the constraints of volume limit and stability limit. 3D packing problem is a NP hard problem. Heuristic algorithm is usually used to find the best position to place the object. When robot is used for packing, manipulation uncertainties should be handled. For example, the collisions between the manipulator and the surroundings, and the planning errors of the manipulator motion trajectories may make some optimal poses infeasible. Thus, the object can only be dropped from a higher place or placed near the optimal pose. The uncertainties of robot in grasping, recognition and placing also lead to the error between the real object position and the planned one. Therefore, an optimization method for robot 3D packing via pushing is proposed based on deep reinforcement learning. Aiming at minimizing the score of the heuristic algorithm for the positions of objects in the bin, robot can reorganize the positions of placed objects via pushing. Meanwhile, the objects are compressed towards a corner to make more space and improve the volume utilization rate of packing.

Key words: reinforcement learning, deep learning, 3D bin packing, robot push

中图分类号: 

  • TP242.2